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Abstract 

We show that a determinant of Stirling cycle numbers counts unlabeled acyclic 
single-source automata. The proof involves a bijection from these automata to 
certain marked lattice paths and a sign-reversing involution to evaluate the deter- 
minant. 



1 Introduction The chief purpose of this paper is to show bijectively that 
a determinant of Stirling cycle numbers counts unlabeled acyclic single-source automata. 
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Specifically, let Ak{n) denote the kn x kn matrix with entry j\, where 

[*] is the Stirling cycle number, the number of permutations on [i] with j cycles. For 
example, 
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As evident in the example, Ak(n) is formed from k copies of each of rows 2 through n + 1 
of the Stirling cycle triangle, arranged so that the first nonzero entry in each row is a 1 
and, after the first row, this 1 occurs just before the main diagonal; in other words, A^{n) 
is a Hessenberg matrix with Is on the infra-diagonal. We will show 

Main Theorem. The determinant of Ak(n) is the number of unlabeled acyclic single- 
source automata with n transient states on a (k + 1) -letter input alphabet. 

Section 2 reviews basic terminology for automata and recurrence relations to count 
finite acyclic automata. Section 3 introduces column-marked subdiagonal paths, which 
play an intermediate role, and a way to code them. Section 4 presents a bijection from 
these column-marked subdiagonal paths to unlabeled acyclic single-source automata. Fi- 
nally, Section 5 evaluates det Ak(n) using a sign- reversing involution and shows that the 
determinant counts the codes for column-marked subdiagonal paths. 

2 Automata 

A (complete, deterministic) automaton consists of a set of states and an input alphabet 
whose letters transform the states among themselves: a letter and a state produce another 
state (possibly the same one). A finite automaton (finite set of states, finite input alphabet 
of, say, k letters) can be represented as a fc-regular directed multigraph with ordered edges: 
the vertices represent the states and the first, second, . . . edge from a vertex give the effect 
of the first, second, . . . alphabet letter on that state. A finite automaton cannot be acyclic 
in the usual sense of no cycles: pick a vertex and follow any path from it. This path must 
ultimately hit a previously encountered vertex, thereby creating a cycle. So the term 
acyclic is used in the looser sense that only one vertex, called the sink, is involved in 
cycles. This means that all edges from the sink loop back to itself (and may safely be 
omitted) and all other paths feed into the sink. 

A non-sink state is called transient. The size of an acyclic automaton is the number of 
transient states. An acyclic automaton of size n thus has transient states which we label 
1,2, ... ,n and a sink, labeled n + 1. Liskovets [1] uses the inclusion-exclusion principle 
(more about this below) to obtain the following recurrence relation for the number dfc(n) 
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of acyclic automata of size n on a fc-letter input alphabet (k > 1): 

n-l j v 

a fc (0) = 1; o*(n) = ^-l)^'" 1 ( ) 0' + l^'VO'), n > 1. 

i=o ^ 

A source is a vertex with no incoming edges. A finite acyclic automaton has at least 
one source because a path traversed backward V\ *— v% *— v$ *— . . . must have distinct 
vertices and so cannot continue indefinitely. An automaton is single-source (or initially 
connected) if it has only one source. Let Bk(n) denote the set of single-source acyclic 
finite (SAF) automata on a fc-letter input alphabet with vertices 1, 2, . . . , n + 1 where 1 
is the source and n + 1 is the sink, and set bk{n) = | Bk{n) |. The two-line representation 
of an automaton in Bk(n) is the 2 x kn matrix whose columns list the edges in order. For 
example, 

'111222333444555 
246666666353226 



B 



is in #3(5) and the source-to-sink paths in B include 1 2 6, 1 — > 4 3 6, 1 — > 
4 5 — > 2 -^6, where the alphabet is {a, 6, c}. 

Proposition 1. TTie number b^in) of SAF automata of size n on a k -letter input alphabet 
(n, k > 1) is given by 



6*(n) = ^(-l) T ^("_ 1 1 )(« + l)*^ 



•°a fc (i) 



Remark This formula is a bit more succinct than the the recurrence in [1, Theorem 
3.2]. 

Proof Consider the set A of acyclic automata with transient vertices [n] = {1,2, ... ,n} 
in which 1 is a source. Call 2,3, ... ,n the interior vertices. For X C [2, n], let 

f(X) = # automata in ^4 whose set of interior vertices includes X, 
g{X) = # automata in A whose set of interior vertices is precisely X. 

Then f(X) = J2 Y - xcyc[2 n] #00 an< ^ by Mobius inversion [2] on the lattice of subsets of 
[2,n], g(X) = ^2y-xcyc\2 n] A i (-^'> ^O/OO where /z(X, F) is the Mobius function for this 
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lattice. Since /j(X,Y) = (— l)' y ' ' x ' if X C Y, we have in particular that 

FC[2,n] 

Let | Y | = n — 2 so that 1 < i < n. When K consists entirely of sources, the vertices 
in [n + 1}\Y and their incident edges form a subautomaton with % transient states; there 
are a^ii) such. Also, all edges from the n — i vertices comprising Y go directly into 
[n + 1]\Y : (i + 1)^-*) choices. Thus f{Y) = (i + l) k ^ n -^a k [i). By definition, c/(0) is 
the number of automata in A for which 1 is the only source, that is, g{$) = 6&(n) and the 
Proposition now follows from (1). □ 

An unlabeled SAF automaton is an equivalence class of SAF automata under relabeling 
of the interior vertices. Liskovets notes [1] (and we prove below) that Bk{n) has no 
nontrivial automorphisms, that is, each of the (n — 1)! relabelings of the interior vertices 
of B e Bk(n) produces a different automaton. So unlabeled SAF automata of size n on 
a fc-letter alphabet are counted by j^z^bk(n). The next result establishes a canonical 
representative in each relabeling class. 

Proposition 2. Each equivalence class in Bk(n) under relabeling of interior vertices has 
size (n — and contains exactly one SAF automaton with the "last occurrences increas- 
ing" property: the last occurrences of the interior vertices — 2, 3, . . . , n — in the bottom row 
of its two-line representation occur in that order. 

Proof The first assertion follows from the fact that the interior vertices of an au- 
tomaton B e bk(n) can be distinguished intrinsically, that is, independent of their labeling. 
To see this, first mark the source, namely 1, with a mark (new label) V\ and observe that 
there exists at least one interior vertex whose only incoming edge(s) are from the source 
(the only currently marked vertex) for otherwise a cycle would be present. For each such 
interior vertex v, choose the last edge from the marked vertex to v using the built-in 
ordering of these edges. This determines an order on these vertices; mark them in order 
t>2, t>3, . . . , Vj (j > 2). If there still remain unmarked interior vertices, at least one of them 
has incoming edges only from a marked vertex or again a cycle would be present. For 
each such vertex, use the last incoming edge from a marked vertex, where now edges are 
arranged in order of initial vertex Vi with the built-in order breaking ties, to order and 
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mark these vertices Vj+i, Vj + 2, .... Proceed similarly until all interior vertices are marked. 
For example, for 

^_/lll222333444555\ 
^246666666353226/' 

Vi — 1 and there is just one interior vertex, namely 4, whose only incoming edge is from 
the source, and so v 2 = 4 and 4 becomes a marked vertex. Now all incoming edges to 
both 3 and 5 are from marked vertices and the last such edges (built-in order comes into 
play) are 4-^5 and 4-^3 putting vertices 3, 5 in the order 5, 3. So v 3 = 5 and t> 4 = 3. 
Finally, v 5 = 2. This proves the first assertion. By construction of the vs, relabeling each 
interior vertex i with the subscript of its corresponding v produces an automaton in Bk(n) 
with the "last occurrences increasing" property and is the only relabeling that does so. 
The example yields 

111222333444555\ 
526434556666666/ 

□ 

Now let Cfc(n) denote the set of canonical SAF automata in Bk(n) representing un- 
labeled automata; thus \Ck(n) \ = j^zjybk(n). Henceforth, we identify an unlabeled au- 
tomaton with its canonical representative. 



3 Column- Marked Subdiagonal Paths 

A subdiagonal (k, n,p)-path is a lattice path of steps E = (1, 0) and N = (0, 1), E for 
east and N for north, from (0,0) to (kn,p) that never rise above the line y = \x. Let 
Ck{n,p) denote the set of such paths. For k > 1, it is clear that Ck(n,p) is nonempty only 
for < p < n and it is known (generalized ballot theorem) that 

1 _ , \ 1 kn — kp + 1 fkn + p + I s 
\Ck{n,p)\ 



kn + p + 1 \ p 

A path P in Cfc(n, n) can be coded by the heights of its E steps above the line y — — 1; 
this gives a a sequence (h)^ subject to the restrictions 1 < 61 < b 2 < . . . < bk n and 
h < \i/k~\ for all i. 
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A column-marked subdiagonal (k, n,p)-path is one in which, for each i e [1, kn], one of 
the lattice squares below the ith E step and above the horizontal line y = — 1 is marked, 
say with a ' * '. Let C k (n,p) denote the set of such marked paths. 



(8,4) 



(0,0) 
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y = -i 
A path in C* 2 (A, 3) 

A marked path P* in C k (n,n) can be coded by a sequence of pairs ((ajj&j))" where 
(&i)i=i is the code for the underlying path P and a, G [1, 6j] gives the position of the * in the 
ith column. The example is coded by (1, 1), (1, 1), (1, 2), (2, 2), (1, 2), (3, 3), (1, 3), (2, 3). 

An explicit sum for | C k (n,n) \ is 

| C k (n,n) | = ^2 b x b 2 ...b kn , 

l<6 1 <6 2 <---<fe/sn, 

hi < [i/^l f° r a U * 

because the summand 6162 • ■ • bk n is the number of ways to insert the ' * 's in the underlying 
path coded by (h)^. 

It is also possible to obtain a recurrence for | C k (n,p) \ , and then, using Prop. 1, to 
show analytically that | C k (n,n) \ = |C fc+1 (n) |. However, it is much more pleasant to 
give a bijection and in the next section we will do so. In particular, the number of SAF 
automata on a 2-letter alphabet is 

\C 2 {n) I = I C x {n,n) \ = ^ b x b 2 . . . b n = (1, 3, 16, 127, 1363, . . .) n > x , 

l<6l<6 2 <-<6n 
bi < i for all i 
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sequence A082161 in [3]. 



4 Bijection from Paths to Automata 

In this section we exhibit a bijection from C k (n, n) to Ck+i(n). Using the illustrated 
path working example with k = 2 and n = 4, 



(0,0). 
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first construct the top row of a two-line representation consisting of k + 1 each Is, 2s, 
. . . , n s and number them left to right: 



1 2 3 4 5 6 7 8 9 10 11 12 

111222333444 



The last step in the path is necessarily an N step. For the second last, third last,. . . N steps 
in the path, count the number of steps following it. This gives a sequence %\, 12, ■ ■ ■ , i n -i 
satisfying 1 < i\ < z 2 < . . . < % n -\ and ij < (k + l)j for all j. Circle the positions 
ii, %2, . . . , i n -i in the two-line representation and then insert (in boldface) 2,3, ... ,n in 
the second row in the circled positions: 

©2 3 4©6 7 8 ® 10 11 12 

111222333444 



These will be the last occurrences of 2, 3, . . . , n in the second row. Working from the last 
column in the path back to the first, fill in the blanks in the second row left to right as 
follows. Count the number of squares from the * up to the path (including the * square) 
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and add this number to the nearest boldface number to the left of the current blank entry 
(if there are no boldface numbers to the left, add this number to 1) and insert the result 
in the current blank square. In the example the numbers of squares are 2,3,1,2,1,2,1,1 
yielding 

(©2 3 4©6 7 8 © 10 11 12 \ 
111222333444 
24533545455 / 

This will fill all blank entries except the last. Note that * s in the bottom row correspond 
to sink (that is, n + 1) labels in the second row. Finally, insert n+1 into the last remaining 
blank space to give the image automaton: 

111222333444\ 
245335454555/ 

This process is fully reversible and the map is a bijection. □ 



5 Evaluation of det Ak(n) 

For simplicity, we treat the case k — 1, leaving the generalization to arbitrary k 
as a not-too-diflicult exercise for the interested reader. Write A(n) for Ai(n). Thus 
A(n) = ( [gt. 1 -] ) • From the definition of det A{n) as a sum of signed products, we 

V 1 3 ) l<i,j<n 

show that det A(n) is the total weight of certain lists of permutations, each list carrying 
weight ±1. Then a weight-reversing involution cancels all —1 weights and reduces the 
problem to counting the surviving lists. These surviving lists are essentially the codes for 
paths in Cl(n,p), and the Main Theorem follows from §4. 

To describe the permutations giving a nonzero contribution to det A(n) = sgn a x 
n" =1 Cbi,aU) , define the code of a permutation a on [n] to be the list c = (q)" =1 with 
Cj = a(i) — (i — l). Since the entry of A(n), [^jl > * s unless j > i — 1, we must have 
a(i) > i — 1 for all i. It is well known that there are 2™" 1 such permutations, corresponding 
to compositions of n, with codes characterized by the following four conditions: (i) Q > 
for all i, (ii) c± > 1, (iii) each q > 1 is immediately followed by q — 1 zeros in the list, 
(iv) Y^i=i °i = n - Let us call such a list a padded composition of n: deleting the zeros 
is a bijection to ordinary compositions of n. For example, (3,0,0,1,2,0) is a padded 
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composition of 6. For a permutation a with padded composition code c, the nonzero 
entries in c give the cycle lengths of a. Hence sgna, which is the parity of u n — # cycles 
in a", is given by (— i)# 0smc . 



where the sum is restricted to padded compositions c of n with < i for all i (A002083) 



Henceforth, let us write all permutations in standard cycle form whereby the smallest 
entry occurs first in each cycle and these smallest entries increase left to right. Thus, 
with dashes separating cycles, 154-2-36 is the standard cycle form of the permutation 
(526143)- We define a nonfirst entry to be one that does not start a cycle. Thus the 
preceding permutation has 3 nonfirst entries: 5,4,6. Note that the number of nonfirst 
entries is only for the identity permutation. We denote an identity permutation (of any 
size) by e. 

By definition of Stirling cycle number, the product in (2) counts lists (jTi)2=i °f permu- 
tations where 7Tj is a permutation on [i + 1] with i + 1 — Cj cycles, equivalently, with Cj < i 
nonfirst entries. So define C n to be the set all lists of permutations 7r = (7fi)f =1 where 7Tj 
is a permutation on [i + 1], # nonfirst entries in 7Tj is < i, 7i"i is the transposition (1,2), 
each nonidentity permutation 7Tj is immediately followed by q — 1 e's where q > 1 is the 
number of nonfirst entries in 7Tj (so the total number of nonfirst entries is n). Assign a 
weight to 7T G C n by wt(vr) = (-l) #£ ' si ". Then 



We now define a weight-reversing involution on (most of) C n . Given ir G C n , scan the 
list of its component permutations tti = (1, 2), 7r 2 , 7r 3 , . . . left to right. Stop at the first 
one that either (i) has more than one nonfirst entry, or (ii) has only one nonfirst entry, b 
say, and b > maximum nonfirst entry m of the next permutation in the list. Say ir^ is the 
permutation where we stop. 



We have det A(n) = J2a s g n a Vii=i a i^{i) = E CT s g n ° FEU [ 2 i-a(i)] > and so 




because 



l+t-d = UIlleSS °i ^ % - 
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In case (i) decrement (i.e. decrease by 1) the number of e's in the list by splitting iik 
into two nonidentity permutations as follows. Let m be the largest nonfirst entry of %k 
and let I be its predecessor. Replace iik and its successor in the list (necessarily an e) by 
the following two permutations: first the transposition (£, m) and second the permutation 
obtained from ir^ by erasing m from its cycle and turning it into a singleton. Here are 
two examples of this case (recall permutations are in standard cycle form and, for clarity, 
singleton cycles are not shown). 



and 
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The reader may readily check that this sends case (i) to case (ii). 

In case (ii), irk is a transposition (a, b) with b > maximum nonfirst entry m of itk+i- in 
this case, increment the number of e's in the list by combining and Hk+i into a single 
permutation followed by an e: in 7r&+i,,& is a singleton; delete this singleton and insert b 
immediately after a in ftk+i (in the same cycle). The reader may check that this reverses 
the result in the two examples above and, in general, sends case (ii) to case (i). Since the 
map alters the number of e's in the list by 1, it is clearly weight-reversing. The map fails 
only for lists that both consist entirely of transpositions and have the form 



(ai,6i), (a 2 ,b 2 ) 



^ni bn) 



with b\ <b 2 < . . . <b n . 



Such lists have weight 1. Hence det A{n) is the number of lists ((dj, satisfying 
l<cii<bi<i + l for 1 < i < n, and b\ < b 2 < • . . < b n . After subtracting 1 from each 
bi, these lists code the paths in C 1 (n, n) and, using §4, det A(n) = \ C 1 (n, n) \ = \ C 2 {n) \. 
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